WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 

A61N C12Q 1/68, C12N 15/00, 
C07H 21/02, A61K 39/00 



Al 



(11) International Publication Number: WO 97/47358 

(43) Internationa! Publication Date: IS December 1997 (18.12.97) 



(21) International Application Number: PCT/US97/09884 

(22) International Filing Date: 6 June 1997 (06.06.97) 



(30) Priority Data: 
60/020,494 
9614731.9 
60/033,534 



11 June 1996(11,06,96) US 

12 July 1996(12.07.96) GB 
20 December 1996 (20.12.96) US 



(71) Applicant (for all designated States except US)z MERCK A 

CO., INC. [US/US]; 126 East Lincoln Avenue, Rahway, NJ 
07065 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): DONNELLY, John. J. 
[US/US]; 126 East Lincoln Avenue, Railway* NJ 07065 
(US). LU, Tong-Ming [US/US]; 126 East Lincoln Avenue, 
Rahway, NJ 07065 (US). LRI. Margaret, A [US/US]; 126 
East Lincoln Avenue, Rahway, NJ 07065 (US). SHIVER, 
John, W. [US/US]; 126 East Lincoln Avenue, Rahway, NJ 
07065 (US). 

(74) Common Representative: MERCK & CO., INC.; 126 East 
Lincoln Avenue. Rahway, NJ 07065 (US). 



(81) Designated Slates: AL, AM, AU, AZ, BA. BB, BG, BR, BY. 
CA, CN. CU, CZ, EE, GE, HU. IL, IS, JP, KG, KR, KZ, 
LC. LK, LR, LT. LV, MD, MG, MK, MN, MX, NO, NZ, 
PL, RO, RU, SG, SI, SK, 17, TM, TR, TT, UA, US. UZ, 
VN, YU, ARIPO patent (GH, KE, LS, MW, SD. SZ, UG), 
Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European patent (AT, BE, CH, DE, DK, ES, FI, FR. GB, 
GR, IE, IT, LU, MC, NL, PT, SE), OAPI patent (BF, BJ, 
CF, CG, CI, CM, GA, GN, ML, MR, NE, SN, TD, TG). 



Published 

With international search report 



(54) Title: SYNTHETIC HEPATITIS C GENES 
(57) Abstract 

This invention relates to novel methods and formulations of nucleic acid pharmaceutical products, specifically formulations of nucleic 
acid vaccine products and nucleic acid gene therapy products. 



1 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


Fl 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


Prance 


LU 


Luxembourg 


SN 


Senegal 


AV 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


CR 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgian) 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Paso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 
Benin 


IIU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CP 


Genual African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KB 


Kenya 


NL 


Netherlands 


YO 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzslan 


NO 


Norway 


ZVY 


Zimbabwe 


CI 


Cote d'lvoirc 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






cv 


Cuba 


KZ 


Kaxakstan 


RO 


Romania 






cz 


Czech Republic 


LC 


Saint Loda 


RU 


Russian Federation 






DB 


Germany 


U 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SB 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







WO 97/47358 



PCI7US97/09884 



- 1 - 

TITLE OF THE INVENTION 
SYNTHETIC HEPATITIS C GENES 

CROSS-REFERENCE TO RELATED APPLICATIONS 
5 Not applicable. 

STATEMENT REGARDING FEDERALLY-SPONSORED R&D 
Not applicable. 

1 0 REFERENCE TO MICROFICHE APPENDIX 
Not applicable. 



15 



FIELD OF THE INVENTION 
Not applicable. 



BACKGROUND OF THE INVENTION 

This invention relates to novel nucleic acid pharmaceutical 
products, specifically nucleic acid vaccine products. The nucleic acid 
vaccine products, when introduced directly into muscle cells, induce the 
20 production of immune respoases which specifically recognize Hepatitis 
C virus (HCV). 

Hepatitis C Virus 

Non-A, Non-B hepatitis (NANBH) is a transmissible disease 

25 (or family of diseases) that is believed to be virally induced, and is 

distinguishable from other forms of virus-associated liver disease, such 
as those caused by hepatitis A virus (HAV), hepatitis B virus (HBV), 
delta hepatitis virus (HDV), cytomegalovirus (CMV) or Epstein-Barr 
virus (EBV). Epidemiologic evidence suggests that there may be three 

30 types of NANBH: the water-borne epidemic type; the blood or needle 
associated type; and the sporadically occurring (community acquired) 
type. However, the number of causative agents is unknown. Recently, a 
new viral species, hepatitis C virus (HCV) has been identified as the 
primary (if not only) cause of blood-associated NANBH (BB-NANBH). 
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Hepatitis C appears to be the major form of transfusion-associated 
hepatitis in a number of countries, including the United States and 
Japan. There is also evidence implicating HCV in induction of 
hepatocellular carcinoma. Thus, a need exists for an effective method 
5 for preventing or treating HCV infection: currently, there is none. 

The HCV may be distantly related to the flaviviridae. The 
Flavivirus family contains a large number of viruses which are small, 
enveloped pathogens of man. The morphology and composition of 
Flavivirus particles are known, and are discussed in M. A. Brinton, in 

10 "The Viruses: The Togaviridae And Flaviviridae" (Series eds. Fraenkel- 
Conrat and Wagner, vol. eds. Schlesinger and Schlesinger, Plenum 
Press, 1986), pp. 327-374. Generally, with respect to morphology, 
Flaviviruses contain a central nucleocapsid surrounded by a lipid 
bi layer. Virions are spherical and have a diameter of about 40-50 nm. 

15 Their cores are about 25-30 nm in diameter. Along the outer surface of 
the virion envelope are projections measuring about 5-10 nm in length 
with terminal knobs about 2 nm in diameter. Typical examples of the 
family include Yellow Fever virus, West Nile virus, and Dengue Fever 
virus. They possess positive-stranded RNA genomes (about 1 1 ,000 

20 nucleotides) that are slightly larger than that of HCV and encode a 
polyprotein precursor of about 3500 amino acids. Individual viral 
proteins are cleaved from this precursor polypeptide. 

The genome of HCV appears to be single-stranded RNA 
containing about 10,000 nucleotides. The genome is positive-stranded, 

25 and possesses a continuous translational open reading frame (ORF) that 
encodes a polyprotein of about 3,000 amino acids. In the ORF, the 
structural proteins appear to be encoded in approximately the first 
quarter of the N-terminal region, with the majority of the polyprotein 
attributed to non-structural proteins. When compared with all known 

30 viral sequences, small but significant co-linear homologies are observed 
with the nonstructural proteins of the Flavivirus family, and with the 
pestiviruses (which are now also considered to be part of the Flavivirus 
family). 
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Intramuscular inoculation of polynucleotide constructs, i.e., 
DNA plasmids encoding proteins have been shown to result in the in situ 
generation of the protein in muscle cells. By using cDNA plasmids 
encoding viral proteins, both antibody and CTL responses were 
5 generated, providing homologous and heterologous protection against 
subsequent challenge with either the homologous or cross-strain 
protection, respectively. Each of these types of immune responses 
offers a potential advantage over existing vaccination strategies. The 
use of PNVs (polynucleotide vaccines) to generate antibodies may result 

10 in an increased duration of the antibody responses as well as the 
provision of an antigen that can have both the exact sequence of the 
clinically circulating strain of virus as well as the proper post- 
translational modifications and conformation of the native protein (vs. a 
recombinant protein). The generation of CTL responses by this means 

15 offers the benefits of cross-strain protection without the use of a live 
potentially pathogenic vector or attenuated virus. 

Therefore, this invention contemplates methods for 
introducing nucleic acids into living tissue to induce expression of 
proteins. The invention provides a method for introducing viral 

20 proteins into the antigen processing pathway to generate virus-specific 
immune responses including, but not limited to, CTLs. Thus, the need 
for specific therapeutic agents capable of eliciting desired prophylactic 
immune responses against viral pathogens is met for HCV virus by this 
invention. Of particular importance in this therapeutic approach is the 

25 ability to induce T-cell immune responses which can prevent infections 
even of virus strains which are heterologous to the strain from which 
the antigen gene was obtained. Therefore, this invention provides DNA 
constructs encoding viral proteins of the hepatitis C virus core, envelope 
(EI), nonstructural (NS5) genes or any other HCV genes which encode 

30 products which generate specific immune responses including but not 
limited to CTLs. 
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DNA Vaccines 

Benvenisty, N., and Reshef, L. [PNAS 83, 9551-9555, 
(1986)] showed that CaCl2-precipitated DNA introduced into mice 
intraperitoneally (i.p.), intravenously (i.v.) or intramuscularly (i.m.) 
5 could be expressed. The i.m. injection of DNA expression vectors 
without CaCl2 treatment in mice resulted in the uptake of DNA by the 
muscle cells and expression of the protein encoded by the DNA . The 
plasmids were maintained episomally and did not replicate. 
Subsequently, persistent expression has been observed after i.m. 

10 injection in skeletal muscle of rats, fish and primates, and cardiac 
muscle of rats. The technique of using nucleic acids as therapeutic 
agents was reported in WO90/1 1092 (4 October 1990), in which 
polynucleotides were used to vaccinate vertebrates. 

It is not necessary for the success of the method that 

15 immunization be intramuscular. The introduction of gold 

microprojectiles coated with DNA encoding bovine growth hormone 
(BGH) into the skin of mice resulted in production of anti-BGH 
antibodies in the mice. A jet injector has been used to transfect skin, 
muscle, fat, and mammary tissues of living animals. Various methods 

20 for introducing nucleic acids have been reviewed. Intravenous injection 
of a DNArcationic liposome complex in mice was shown by Zhu et al., 
[Science 261:209-21 1 (9 July 1993) to result in systemic expression of a 
cloned transgene. Ulmer et al., [Science 259: 1 745- 1 749, ( 1 993)] 
reported on the heterologous protection against influenza virus infection 

25 by intramuscular injection of DNA encoding influenza virus proteins. 

The need for specific therapeutic and prophylactic agents 
capable of eliciting desired immune responses against pathogens and 
tumor antigens is met by the instant invention. Of particular 
importance in this therapeutic approach is the ability to induce T-cell 

30 immune responses which can prevent infections or disease caused even 
by virus strains which are heterologous to the strain from which the 
antigen gene was obtained. This is of particular concern when dealing 
with HIV as this virus has been recognized to mutate rapidly and many 
virulent isolates have been identified [see, for example, LaRosa et al., 
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Science 249:932-935 (1990), identifying 245 separate HIV isolates]. In 
response to this recognized diversity, researchers have attempted to 
generate CTLs based on peptide immunization. Thus, Takahashi et al., 
[Science 255:333-336 (1992)] reported on the induction of broadly 
5 cross-reactive cytotoxic T cells recognizing an HTV envelope (gp!60) 
determinant. However, those workers recognized the difficulty in 
achieving a truly cross-reactive CTL response and suggested that there 
is a dichotomy between the priming or restimulation of T cells, which is 
very stringent, and the elicitation of effector function, including 

10 cytotoxicity, from already stimulated CTLs. 

Wang et al. reported on elicitation of immune responses in 
mice against HIV by intramuscular inoculation with a cloned, genomic 
(unspliced) HIV gene. However, the level of immune respoases 
achieved in these studies was very low. In addition, the Wang et al., 

1 5 DN A construct utilized an essentially genomic piece of HTV encoding 
contiguous Tat//?£V-gpl60-Tat//?£V coding sequences. As is described 
in detail below, this is a suboptimal system for obtaining high-level 
expression of the gpl60. It also is potentially dangerous because 
expression of Tat contributes to the progression of Karposi's Sarcoma. 

20 WO 93/17706 describes a method for vaccinating an animal 

against a virus, wherein carrier particles were coated with a gene 
construct and the coated particles are accelerated into cells of an animal. 

The instant invention contemplates any of the known 
methods for introducing polynucleotides into living tissue to induce 

25 expression of proteins. However, this invention provides a novel 
immunogen for introducing proteins into the antigen processing 
pathway to efficiently generate specific CTLs and antibodies. 

Codon Usage and Codon Context 
30 The codon pairings of organisms are highly nonrandom, 

and differ from organism to organism. This information is used to 
construct and express altered or synthetic genes having desired levels of 
translational efficiency, to determine which regions in a genome are 
protein coding regions, to introduce translational pause sites into 
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heterologous genes, and to ascertain relationship or ancestral origin of 
nucleotide sequences 

The expression of foreign heterologous genes in 
transformed organisms is now commonplace, A large number of 
5 mammalian genes, including, for example, murine and human genes, 
have been successfully inserted into single celled organisms. Standard 
techniques in this regard include introduction of the foreign gene to be 
expressed into a vector such as a plasmid or a phage and utilizing that 
vector to insert the gene into an organism. The native promoters for 

10 such genes are commonly replaced with strong promoters compatible 
with the host into which the gene is inserted. Protein sequencing 
machinery permits elucidation of the amino acid sequences of even 
minute quantities of native protein. From these amino acid sequences, 
DNA sequences coding for those proteins can be inferred. DN A 

15 synthesis is also a rapidly developing art, and synthetic genes 
corresponding to those inferred DNA sequences can be readily 
coastructed. 

Despite the burgeoning knowledge of expression systems 
and recombinant DNA, significant obstacles remain when one attempts 

20 to express a foreign or synthetic gene in an organism. Many native, 
active proteins, for example, are glycosylated in a manner different 
from that which occurs when they are expressed in a foreign host. For 
this reason, eukaryotic hosts such as yeast may be preferred to bacterial 
hosts for expressing many mammalian genes. The glycosylation 

25 problem is the subject of continuing research. 

Another problem is more poorly understood. Often 
translation of a synthetic gene, even when coupled with a strong 
promoter, proceeds much less efficiently than would be expected. The 
same is frequently true of exogenous genes foreign to the expression 

30 organism. Even when the gene is transcribed in a sufficiently efficient 
manner that recoverable quantities of the translation product are 
produced, the protein is often inactive or otherwise different in 
properties from the native protein. 
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It is recognized that the latter problem is commonly due to 
differences in protein folding in various organisms. The solution to this 
problem has been elusive, and the mechanisms controlling protein 
folding are poorly understood. 

5 The problems related to translational efficiency are 

believed to be related to codon context effects. The protein coding 
regions of genes in all organisms are subject to a wide variety of 
functional constraints, some of which depend on the requirement for 
encoding a properly functioning protein, as well as appropriate 

10 translational start and stop signals. However, several features of protein 
coding regions have been discerned which are not readily understood in 
terms of these constraints. Two important classes of such features are 
those involving codon usage and codon context. 

It is known that codon utilization is highly biased and varies 

15 considerably between different organisms. Codon usage patterns have 
been shown to be related to the relative abundance of tRNA 
isoacceptors. Genes encoding proteins of high versus low abundance 
show differences in their codon preferences. The possibility that biases 
in codon usage alter peptide elongation rates has been widely discussed. 

20 While differences in codon use are associated with differences in 

translation rates, direct effects of codon choice on translation have been 
difficult to demonstrate. Other proposed constraints on codon usage 
patterns include maximizing the fidelity of translation and optimizing 
the kinetic efficiency of protein synthesis. 

25 Apart from the non-random use of codons, considerable 

evidence has accumulated that codon/anticodon recognition is influenced 
by sequences outside the codon itself, a phenomenon termed "codon 
context." There exists a strong influence of nearby nucleotides on the 
efficiency of suppression of nonsense codons as well as missense codons. 

30 Clearly, the abundance of suppressor activity in natural bacterial 
populations, as well as the use of "termination" codons to encode 
selenocysteine and phosphoserine require that termination be context- 
dependent. Similar context effects have been shown to influence the 
fidelity of translation, as well as the efficiency of translation initiation. 
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Statistical analyses of protein coding regions of E. coli have 
demonstrate another manifestation of "codon context." The presence of 
a particular codon at one position strongly influences the frequency of 
occurrence of certain nucleotides in neighboring codons, and these 
5 context constraints differ markedly for genes expressed at high versus 
low levels. Although the context effect has been recognized, the 
predictive value of the statistical rules relating to preferred nucleotides 
adjacent to codons is relatively low. This has limited the utility of such 
nucleotide preference data for selecting codons to effect desired levels 

10 of translational efficiency. 

The advent of automated nucleotide sequencing equipment 
has made available large quantities of sequence data for a wide variety 
of organisms. Understanding those data presents substantial difficulties. 
For example, it is important to identify the coding regions of the 

15 genome in order to relate the genetic sequence data to protein 

sequences. In addition, the ancestry of the genome of certain organisms 
is of substantial interest. It is known that genomes of some organisms 
are of mixed ancestry. Some sequences that are viral in origin are now 
stably incorporated into the genome of eukaryotic organisms. The viral 

20 sequences themselves may have originated in another substantially 
unrelated species. An understanding of the ancestry of a gene can be 
important in drawing proper analogies between related genes and their 
translation products in other organisms. 

There is a need for a better understanding of codon context 

25 effects on translation, and for a method for determining the appropriate 
codons for any desired translational effect. There is also a need for a 
method for identifying coding regions of the genome from nucleotide 
sequence data. There is also a need for a method for controlling protein 
folding and for insuring that a foreign gene will fold appropriately 

30 when expressed in a host. Genes altered or constructed in accordance 
with desired translational efficiencies would be of significant worth. 

Another aspect of the practice of recombinant DNA 
techniques for the expression by microorganisms of proteins of 
industrial and pharmaceutical interest is the phenomenon of "codon 
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preference". While it was earlier noted that the existing machinery for 
gene expression is genetically transformed host cells will "operate" to 
construct a given desired product, levels of expression attained in a 
microorganism can be subject to wide variation, depending in part on 
5 specific alternative forms of the amino acid-specifying genetic code 
present in an inserted exogenous gene. A "triplet" codon of four 
possible nucleotide bases can exist in 64 variant forms. That these 
forms provide the message for only 20 different amino acids (as well as 
transcription initiation and termination) means that some amino acids 

10 can be coded for by more than one codon. Indeed, some amino acids 
have as many as six "redundant", alternative codons while some others 
have a single, required codon. For reasons not completely understood, 
alternative codons are not at all uniformly present in the endogenous 
DN A of differing types of cells and there appears to exist a variable 

15 natural hierarchy or "preference" for certain codons in certain types of 
cells. 

As one example, the amino acid leucine is specified by any 
of six DNA codons including CTA, CTC, CTG, CTT, TTA, and TTG 
(which correspond, respectively, to the mRNA codons, CUA, CUC, 

20 CUG, CUU, UU A and UUG). Exhaustive analysis of genome codon 
frequencies for microorganisms has revealed endogenous DNA of 
coli most commonly contains the CTG leucine-specifying codon, while 
the DNA of yeasts and slime molds most commonly includes a TTA 
leucine-specifying codon. In view of this hierarchy, it is generally held 

25 that the likelihood of obtaining high levels of expression of a leucine- 
rich polypeptide by an E. coli host will depend to some extent on the 
frequency of codon use. For example, a gene rich in TTA codons will 
in all probability be poorly expressed in E. coli . whereas a CTG rich 
gene will probably highly express the polypeptide. Similarly, when 

30 yeast cells are the projected transformation host cells for expression of a 
leucine-rich polypeptide, a preferred codon for use in an inserted DNA 
would be TTA. 

The implications of codon preference phenomena on 
recombinant DNA techniques are manifest, and the phenomenon may 
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serve to explain many prior failures to achieve high expression levels of 
exogenous genes in successfully transformed host organisms-a less 
"preferred" codon may be repeatedly present in the inserted gene and 
the host cell machinery for expression may not operate as efficiently. 
5 This phenomenon suggests that synthetic genes which have been 

designed to include a projected host cell's preferred codons provide a 
preferred form of foreign genetic material for practice of recombinant 
DNA techniques. 

10 Protein Trafficking 

The diversity of function that typifies eukaryotic cells 
depends upon the structural differentiation of their membrane 
boundaries. To generate and maintain these structures, proteins must be 
transported from their site of synthesis in the endoplasmic reticulum to 

15 predetermined destinations throughout the cell. This requires that the 
trafficking proteins display sorting signals that are recognized by the 
molecular machinery responsible for route selection located at the 
access points to the main trafficking pathways. Sorting decisions for 
most proteins need to be made only once as they traverse their 

20 biosynthetic pathways since their final destination, the cellular location 
at which they perform their function, becomes their permanent 
residence. 

Maintenance of intracellular integrity depends in part on 
the selective sorting and accurate transport of proteins to their correct 
25 destinations. Over the past few years the dissection of the molecular 
machinery for targeting and localization of proteins has been studied 
vigorously. Defined sequence motifs have been identified on proteins 
which can act as 'address labels'. A number of sorting signals have been 
found associated with the cytoplasmic domains of membrane proteins. 

30 

SUMMARY OF THE INVENTION 

This invention relates to novel formulations of nucleic acid 
pharmaceutical products, specifically nucleic acid vaccine products. 
The nucleic acid products, when introduced directly into muscle cells, 



WO 97/47358 



PCT/US97/09884 



- II - 

induce the production of immune responses which specifically recognize 
Hepatitis C virus (HCV). 

BRIEF DESCRIPTION OF THE DRAWINGS 
5 Figure 1 shows the nucleotide sequence of the VIRa vector. 

Figure 2 is a diagram of the VIRa vector. 
Figure 3 is a diagram of the Vtpa vector. 
Figure 4 is the VUb vector 

Figure 5 shows an optimized sequence of the HCV core 

10 antigen. 

Figure 6 shows V 1 Ra.HCV 1 CorePAb, Vtpa.HC V 1 CoreP Ab 
and VUb.HCVlCorePAb. 

Figure 7 shows the Hepatitis C Virus Core Antigen 

Sequence. 

15 Figure 8 shows codon utilization in human protein-coding 

sequences (from Lathe et al.). 

Figure 9 shows an optimized sequence of the HCV E 1 

protein. 

Figure 10 shows an optimized sequence of the HCV E2 

20 protein. 

Figure 1 1 shows an optimized sequence of the HCV El +E2 

proteins. 

Figure 12 shows an optimized sequence of the HCV NS5a 

protein. 

25 Figure 13 shows an optimized sequence of the HCV NS5b 

protein. 

DETAILED DESCRIPTION OF THE INVENTION 

This invention relates to novel formulations of nucleic acid 
30 pharmaceutical products, specifically nucleic acid vaccine products. 

The nucleic acid vaccine products, when introduced directly into muscle 
cells, induce the production of immune responses which specifically 
recognize Hepatitis C virus (HCV). 
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Non-A, Non-B hepatitis (NANBH) is a transmissible disease 
(or family of diseases) that is believed to be virally induced, and is 
distinguishable from other forms of virus-associated liver disease, such 
as those caused by hepatitis A virus (HAV), hepatitis B virus (HBV), 
5 delta hepatitis virus (HDV), cytomegalovirus (CMV) or Epstein-Barr 
virus (EBV). Epidemiologic evidence suggests that there may be three 
types of NANBH: the water-borne epidemic type; the blood or needle 
associated type; and the sporadically occurring (community acquired) 
type. However, the number of causative agents is unknown. Recently, a 

10 new viral species, hepatitis C virus (HCV) has been identified as the 
primary (if not only) cause of blood-associated NANBH (BB-NANBH). 
Hepatitis C appears to be the major form of transfusion-associated 
hepatitis in a number of countries, including the United States and 
Japan. There is also evidence implicating HCV in induction of 

15 hepatocellular carcinoma. Thus, a need exists for an effective method 
for preventing or treating HCV infection: currently, there is none. 

The HCV may be distantly related to the flaviviridae. The 
Flavivirus family contains a large number of viruses which are small, 
enveloped pathogens of man. The morphology and composition of 

20 Flavivirus particles are known, and are discussed in M. A. Brinton, in 
"The Viruses: The Togaviridae And Flaviviridae" (Series eds. Fraenkel- 
Conrat and Wagner, vol. eds. Schlesinger and Schlesinger, Plenum 
Press, 1986), pp. 327-374. Generally, with respect to morphology, 
Flavivimses contain a central nucleocapsid surrounded by a lipid 

25 bilayer. Virions are spherical and have a diameter of about 40-50 nm. 
Their cores are about 25-30 nm in diameter. Along the outer surface of 
the virion envelope are projections measuring about 5-10 nm in length 
with terminal knobs about 2 nm in diameter. Typical examples of the 
family include Yellow Fever virus, West Nile virus, and Dengue Fever 

30 virus. They possess positive-stranded RNA genomes (about 1 1 ,000 
nucleotides) that are slightly larger than that of HCV and encode a 
polyprotein precursor of about 3500 amino acids. Individual viral 
proteins are cleaved from this precursor polypeptide. 
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The genome of HCV appears to be single-stranded RNA 
containing about 10,000 nucleotides. The genome is positive-stranded, 
and possesses a continuous translational open reading frame (ORF) that 
encodes a polyprotein of about 3,000 amino acids. In the ORF, the 
5 structural proteins appear to be encoded in approximately the first 
quarter of the N-terminal region, with the majority of the polyprotein 
attributed to non-structural proteins* When compared with all known 
viral sequences, small but significant co-linear homologies are observed 
with the nonstructural proteins of the Flavivirus family, and with the 
10 pestiviruses (which are now also considered to be part of the Flavivirus 
family). 

Intramuscular inoculation of polynucleotide constructs, i.e., 
DNA plasmids encoding proteins have been shown to result in the 
generation of the encoded protein in situ in muscle cells. By using 

15 cDNA plasmids encoding viral proteins, both antibody and CTL 
responses were generated, providing homologous and heterologous 
protection against subsequent challenge with either the homologous or 
cross-strain protection, respectively. Each of these types of immune 
responses offers a potential advantage over existing vaccination 

20 strategies. The use of PNVs (polynucleotide vaccines) to generate 

antibodies may result in an increased duration of the antibody responses 
as well as the provision of an antigen that can have both the exact 
sequence of the clinically circulating strain of virus as well as the 
proper post-translational modifications and conformation of the native 

25 protein (vs. a recombinant protein). The generation of CTL responses 
by this means offers the benefits of cross-strain protection without the 
use of a live potentially pathogenic vector or attenuated virus. 

The standard techniques of molecular biology for 
preparing and purifying DNA constructs enable the preparation of the 

30 DNA therapeutics of this invention. While standard techniques of 
molecular biology are therefore sufficient for the production of the 
products of this invention, the specific constructs disclosed herein 
provide novel therapeutics which surprisingly produce cross-strain 
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protection, a result heretofore unattainable with standard inactivated 
whole virus or subunit protein vaccines. 

The amount of expressible DNA to be introduced to a 
vaccine recipient will depend on the strength of the transcriptional and 
5 translational promoters used in the DNA construct, and on the 
immunogenicity of the expressed gene product. In general, an 
immunologically or prophylactically effective dose of about 1 \xg to 1 
mg, and preferably about 10 ^g to 300 \ig is administered directly into 
muscle tissue. Subcutaneous injection, intradermal introduction, 
10 impression through the skin, and other modes of administration such as 
intraperitoneal, intravenous, or inhalation delivery are also 
contemplated. It is also contemplated that booster vaccinations are to be 
provided. 

The DNA may be naked, that is, unassociated with any 

15 proteins, adjuvants or other agents which impact on the recipients 
immune system. In this case, it is desirable for the DNA to be in a 
physiologically acceptable solution, such as, but not limited to, sterile 
saline or sterile buffered saline. Alternatively, the DNA may be 
associated with surfactants, liposomes, such as lecithin liposomes or 

20 other liposomes known in the art, as a DNA-liposome mixture, (see for 
example WO93/24640) or the DNA may be associated with an adjuvant 
known in the art to boost immune responses, such as a protein or other 
carrier. Agents which assist in the cellular uptake of DNA, such as, but 
not limited to, calcium ions, detergents, viral proteins and other 

25 transfection facilitating agents may also be used to advantage. These 
agents are generally referred to as transfection facilitating agents and as 
pharmaceutically acceptable carriers. As used herein, the term gene 
refers to a segment of nucleic acid which encodes a discrete polypeptide. 
The term pharmaceutical, and vaccine are used interchangeably to 

30 indicate compositions useful for inducing immune responses. The terms 
construct, and plasmid are used interchangeably. The term vector is 
used to indicate a DNA into which genes may be cloned for use 
according to the method of this invention. 
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The following examples are provided to further define the 
invention, without limiting the invention to the specifics of the 
examples. 

5 EXAMPLE 1 

VI J EXPRESSION VECTORS: 

VI J is derived from vectors VI and pUC18, a 
commercially available plasmid. VI was digested with Sspl and EcoRI 
restriction enzymes producing two fragments of DNA. The smaller of 

10 these fragments, containing the CMVintA promoter and Bovine Growth 
Hormone (BGH) transcription termination elements which control the 
expression of heterologous genes, was purified from an agarose 
electrophoresis gel. The ends of this DNA fragment were then 
"blunted" using the T4 DNA polymerase enzyme in order to facilitate 

15 its ligation to another "blunt-ended" DNA fragment. 

pUC 1 8 was chosen to provide the "backbone" of the 
expression vector. It is known to produce high yields of plasmid, is 
well-characterized by sequence and function, and is of minimum size. 
We removed the entire lac operon from this vector, which was 

20 unnecessary for our purposes and may be detrimental to plasmid yields 
and heterologous gene expression, by partial digestion with the Haell 
restriction enzyme. The remaining plasmid was purified from an 
agarose electrophoresis gel, blunt-ended with the T4 DNA polymerase , 
treated with calf intestinal alkaline phosphatase, and ligated to the 

25 CMVintA/BGH element described above. Plasmids exhibiting either of 
two possible orientations of the promoter elements within the pUC 
backbone were obtained. One of these plasmids gave much higher 
yields of DNA in E. coli and was designated VI J. This vector's 
structure was verified by sequence analysis of the junction regions and 

30 was subsequently demonstrated to give comparable or higher expression 
of heterologous genes compared with VI. The ampicillin resistance 
marker was replaced with the neomycin resistance marker to yield 
vector VUneo. 
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An Sfi I site was added to VlJneo to facilitate integration 
studies, A commercially available 1 3 base pair Sfi I linker (New 
England BioLabs) was added at the Kpn I site within the BGH sequence 
of the vector. VlJneo was linearized with Kpn I, gel purified, blunted 
5 by T4 DNA polymerase, and ligated to the blunt Sfi I linker. Clonal 
isolates were chosen by restriction mapping and verified by sequencing 
through the linker. The new vector was designated VUns. Expression 
of heterologous genes in VUns (with Sfi I) was comparable to 
expression of the same genes in VlJneo (with Kpn I). 

10 Vector VIRa (Sequence is shown in Figure 1; map is shown 

in Figure 2) was derived from vector VIR, a derivative of the VUns 
vector. Multiple cloning sites (fljj/II, Kpnh EcoRV, EcoRI, Sail, and 
NotY) were introduced into VIR to create the VIRa vector to improve 
the convenience of subcloning. VIRa vector derivatives containing the 

15 tpa leader sequence and ubiquitin sequence were generated (Vtpa 
(Figure 3) and Vub (Figure 4), respectively). Expression of viral 
antigen from Vtpa vector will target the antigen protein into the 
exocytic pathway, thus producing a secretable form of the antigen 
proteins. These secreted proteins are likely to be captured by 

20 professional antigen presenting cells, such as macrophages and dendritic 
cells, and processed and presented by class II molecules to activate CD4+ 
Th cells. They also are more likely to efficiently simulate antibody 
responses. Expression of viral antigen through VUb vector will 
produce a ubiquitin and antigen fusion protein. The uncleavable 

25 ubiquitin segment (glycine to alanine change at the cleavage site, Butt et 
ah, JBC 263:16364, 1988) will target the viral antigen to ubiquitin- 
associated proteasomes for rapid degradation. The resulting peptide 
fragments will be transported into the ER for antigen presentation by 
class I molecules. This modification is attempted to enhance the class I 

30 molecule-restricted CTL responses against the viral antigen (Townsend 
et al, JEM 168:1211, 1988). 



WO 97/47358 



PCT/US97/09884 



- 17- 

EXAMPLE 2 

DESIGN AND CONSTRUCTION OF THE SYNTHETIC GENES 

A. Design of Synthetic Gene Segments for HCV Gene Expression : 
5 Gene segments were converted to sequences having 

identical translated sequences (except where noted) but with alternative 
codon usage as defined by R. Lathe in a research article from J. Molec. 
Biol. Vol. 183, pp. 1-12 (1985) entitled "Synthetic Oligonucleotide 
Probes Deduced from Amino Acid Sequence Data: Theoretical and 

10 Practical Considerations". The methodology described below was based 
on our hypothesis that the known inability to express a gene efficiently 
in mammalian cells is a consequence of the overall transcript 
composition. Thus, using alternative codons encoding the same protein 
sequence may remove the constraints on HCV gene expression. 

15 Inspection of the codon usage within HCV genome revealed that a high 
percentage of codons were among those infrequently used by highly 
expressed human genes. The specific codon replacement method 
employed may be described as follows employing data from Lathe et 
al.: — - _ _ 

20 ^ 1; Identify placement of codons for proper open 

/ / reading frame. 

2. Compare wild type codon for observed frequency of 
i use by human genes (refer to Table 3 in Lathe et al.). 

3. If codon is not the most commonly employed, 

25 replace it with an optimal codon for high expression based on data in 
i Table 5. 

4. Inspect the third nucleotide of the new codon and the 
first nucleotide of the adjacent codon immediately 3 1 - of the first. If a 

; 5'-CG-3' pairing has been created by the new codon selection, replace it 
30 with the choice indicated in Table 5* 
! 5. Repeat this procedure until the entire gene segment 

has been replaced. 

6. Inspect new gene sequence for undesired sequences 
! generated by these codon replacements (e.g., "ATTTA" sequences, 
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j ! inadvertent creation of intron splice recognition sites, unwanted ! 

j ! ! 

I restriction enzyme sites, etc.) and substitute codons that eliminate these j 

1 \ sequences. ! 

7. Assemble synthetic gene segments and test for I 

5 j improved expression. : 

B. HCV CORE ANTIGEN SEQUENCE 

The consensus core sequence of HCV was adopted from a 
generalized core sequence reported by Bukh et aL (PNAS, 91:8239, 
10 1994). This core sequence contains all the identified CTL epitopes in 
both human and mouse. The gene is composed of 573 nucleotides and 
encodes 191 amino acids. The predicted molecular weight is about 23 
kDa. 

The codon replacement was conducted to eliminate codons 
15 which may hinder the expression of the HCV core protein in transfected 
mammalian cells in order to maximize the translational efficiency of 
DNA vaccine.f Twenty three point two percent (23.2%) of nucleotide 
sequence (133 out of 573 nucleotides) were altered, resulting in changes? 
\ of 61.3% of the codons! (1 17 out 191 codons) in the core antigen" 
20 1 sequence. THe optimized nucleotide sequence of HCV core is shown in 
Figure 5. 

C. CONSTRUCTION OF THE SYNTHETIC CORE GENE 

The optimized HCV core gene (Figure 5) was constructed 
25 as a synthetic gene annealed from multiple synthetic oligonucleotides. 
To facilitate the identification and evaluation of the synthetic gene 
expression in cell culture and its immunogenicity in mice, a CTL 
epitope derived from influenza virus nucleoprotein residues 366-374 
and an antibody epitope sequence derived from SV40 T antigen residues 
30 684-698 were tagged to the carboxyl terminal of the core sequence 
(Figure 6). For clinical use it may be desired to express the core 
sequence without the nucleoprotein 366-374 and SV40 T 6S4-698 
sequences. For this reason, the sequence of the two epitopes is flanked 
by two EcoRl sites which will be used to excise this fragment of 
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sequence at a later time. Thus an embodiment of the invention for 
clinical use could consist of the V 1 Ra.HC V 1 CorePAb, 
Vtpa.HCVl CorePAb, or VUb.HCV I CorePAb plasmids that had been 
cut with EcoRI, annealed, and ligated to yield plasmids 
5 V 1 Ra.HCV 1 Core, Vtpa.HCV 1 Core, and VUb.HCV 1 Core. 

The synthetic gene was built as three separate segments in 
three vectors, nucleotides 1 to 80 in VIRa, nucleotides 80 to 347 {BstXl 
site) in pUCl 8, and nucleotides 347 to 573 plus the two epitope 
sequence in pUC18. All the segments were verified by DNA 
10 sequencing, and joined together in V I Ra vector 

D. HCV Gene Expression Constructs: 

In each case, the junction sequences from the 5' promoter 
region (CMVintA) into the cloned gene is shown. The position at which 
15 the junction occurs is demarcated by a 7", which does not represent any 
discontinuity in the sequence. 

The nomenclature for these constructs follows the 
convention: "Vector name-HCV strain-gene". 

20 

VlRa.HC VI. CorePAb 
— InlA-AGA TCT ACC / ATG AGC-HCV.Core.-GCC / GAA TTC GCT TCC- 
PAb Sequence-TAA / ACC CGG GAA TTC TA A A / GTC GAC--BGH — 

25 Vtpa.HCVl. CorePAb 

— IntA-ATC ACC / ATG GAT-tpa Icader-G AG ATC-TTC / ATG AGC-- 
HCV.Core.~GCC / GAA TTC GCT TCC-PAb Sequence-TAA / ACC CGG GAA 
TTC TAA A / GTC GAC-BGH — 

30 VUb.HCV 1 .CorePAb. 

— IntA-AGA TCC ACC / ATG CAG-Ubiquitin-GGT GCA GAT CTG/ ATG AGC- 
HCV.Core.~GCC / GAA TTC GCT TCC-PAb Sequcnce-TAA / ACC CGG GAA 
TTC TAA A / GTC GAC-BGH— 
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VlRa.HCVI.Core 
™IntA~AGA TCT ACC / ATG AGC--HCV.Core.-GCC / TAA A / GTC GAC- 
BGH — 

5 Vtpa.HCVl.Core 

— IntA— ATC ACC / ATG G AT-tpa Icader-GAG ATC-TTC / ATG AGC - 
HCV.Core.~GCC / TAA A / GTC GAC--BGH — 

VUb.HCV 1 .Core 

1 0 — IntA-AGA TCC ACC / ATG CAG-Ubiquitin-GGT GCA GAT CTG/ ATG AGC- 
HCV.Core.-GCC / TAA A / GTC GAC--BGH— 

E. OTHER SYNTHETIC HCV GENES 

Using similar codon optimization techniques, synthetic 
1 5 genes encoding the HCV El (Figure 9), HCV E2 (Figure 10), HCV 
E1+E2 (Figure 1 1), HCV NS5a (Figure 12) and HCV NS5b (Figure 13) 
proteins were created. 
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WHAT IS CLAIMED: 



1 . A synthetic polynucleotide comprising a DN A 
sequence encoding an HCV protein selected from the group consisting 
5 of HCV core protein, HCV El protein, HCV E1+E2 protein, HCV NS5a 
protein, HCV NS5b protein and fragments thereof, the DNA sequence 
comprising codons optimized for expression in a vertebrate host. 



2. A plasmid vector comprising the polynucleotide of 
10 Claim 1 , the plasmid vector being suitable for immunization of a 
vertebrate host. 



15 



20 



25 



3. The polynucleotide of Claim 1 which is HCV 
genotype I/Ia core. 



4. The polynucleotide of Claim 1 

cCAilA(>|AAi( ACOAAt (tit > (A " ~ ~ 



I AT^Atf/AOcA AcOOeAAiiO 

tfl aiJ^rOAi^ATt t7lV)(3ii l vV.tc)i;: TV-TAtTcTT-rT i^t/cii^jA*"^ 

1M Aiii'JTIVVi 'A <p"CcAi>rV>.* AiTi«V>iirAi3i" OeATCCCCAA 

241 TAr07Cn>Ci7 an^lXlTA-D^ CAAUvVUtfX TmsGcTosG 

\2A 0??rXiXAiM f^AO.VCaO.^ i^MWiMrt iiAAtrcH??^ 

4 til 7»>0<;fc:TA<:*AT OX^r*IV|t3T.i tasfUTti-rtrt Trftr^iT^rr 

4«l a?3?n , J\A< , T A'PXTAO't-4^ i-AAccTWrt i5^rT<XTi:cT 



ArA» taa< |fCr i'Ai 

i;ayi\irAO;e TOAk(i/nMri> 
iX>.V<«X|a ( »> 0C t (?A» ? 3 :< :* : 

etaJcTv^vT iicnyirren* 

AAi^T^ | ATt 1 1 ArAOTCTViAC 

rpvt Arrant cnwcrCA'P:; 



having the sequence 

Minn a«:i?<m.« AAuAt i-iV t »; i^fi 

Mlt lOVTVca OSCCccrcTV; i2ti 
cTVartWTTt t}~t«TACCI\jA 400 

lijinviArtiSTT .|«^n?3A»Ti::At 480 
CT-Ktotott txr-n5Ai\*7r sto 

vn 



5. The plasmid vector of Claim 2 having the sequence 



30 
35 
40 
45 
50 
55 



1 iJATATPJXT A*rfl£2.i.*A*IT (AjATACKTTtt TAHXATATT 
81 i 3 VATVrTOSA OATTl^ATTAT 1\iA<TTAHTTA TTAAIMTTAA 
1M TOflXlCTTAr ATAAiTTTACG iTTMKKttrC OXXTWT? 

241 tatiitpx^ca TAf3TAAi^^:t: aatw^jtwtt viccatpsac 

J21 AiTTACATVAA 'TIvrTATVATA HXCAAiTTAT i^rnVTATT 
4fll Ai.rrAi'A'PlAi" OTTA'PWr.AC TmnTTACTT WAiSTACAT 
4»1 TOJCAirrACA TTAA'P'WSCr; 'PXATAtXttK TnWACTC'Ar 

'.m Tmrrrmjtj cao;aaaatv aactogactt t*.x*aaaat>tp 
mi imirrAcarm iXw^mTAT ataawahau nvirrrmirr 

721 I3A0CTTCATA <3AAi IAi.TACO j (JJACOTATOr AlSWTOMCK 

801 itAimiiAttrm fcriarcoccT ATAi^AinrTA •raasrccAtT*' 

«81 ttTIVTATACA irorCCtXTTV I7TV ATITTAT AOTPOTttTr 
*>&1 iVACTCCCCT ATIt3i?nWk; ATACTTTTVA TTACTAATi " 
1041 TV/VAATAOA i^^TTt.'A lAGACnSACA OV-A ^r^P Tr 
1121 ACATATACAA CAiVACCtmr OWmyXi: tXAirTTTTTA 

i2:n TrrmnmAC ATua;m~TT imruttrwa:* iwsjaoctt 
12^1 isivtscTvw?: Aia/rec-mx* ■rxtaacaot i;»w.m:i'a*>a 
acaai a rvirr ixswnatw: TATi^nv-iv: aaaahja^t 
1441 AAia>;AtX'>". CAtyvAi^AAi-A TWAra^wx* Tv-jwrornu 

l r -!!1 iTTTAAi^JH? fSAtOXWTIK TAiriCTVW*.* AinACTOTTT 

t'.ui AOAtyurrorr ixwah: lavrrrrTTCT orAiTirAeos 

t'.Hi (IAAiIACCAAG AOTAACACCA ATA03A*>5C»: OrAWSAHTP:; 

i7f,! tv^itivai; «Af»a5rror a»hs.xm; ■n;w>»^Ai; 

i7Ai>vrATtV iXAAtSSrCAi; iJMJt^XH^AlS a30OX"PVT 

:'>2i AOCTnmr TWttSCRjsTT i a ;::rr.t'*nrrr mx^jwr 

^U01 rCAi^JAAi^ IWVAA^TP ; ATT»-Ai'A<"i*i * 'H^ACt^nm?; 

i«Hi rtmTn?"a"W5 fl?7it5?n*Ai: taifTt-ni^^T fA*n?:«m3A 

2tf.l i>\1»M"Hr Ti-iTTi^-fA UTIVVTiiSfT !S3.t"trnXTk7 

j.!41 Ati>ii.M>-t**>tt mun'lrtcrcMt tt HrfriM(|i:n:t .«:i:.n:jtt:t-f| 

j>Jt t.t.Vii|ti:t|.^0 Aijixa:t:ii;(.ij ATt^STPriv: ixntTwriT 

inn i:Ati7imV5AA i»Tn2ci:At"n: ttixtaataa 



ATAATA*P5TA CATTTATATT t^XTl.-A'PTn" CAACATTA' V HU 
*n*AATTA03(3 Gr7n7A1TA<rr TVATAfJCOrA TATA'njJAiTP lf.O 
A(^3^7AAC CACrit-i^r rAmiAOrrr AATOATt^n-, 240 
inVAA^JiTr l5taM3TATTPA O53TAAA0TVi OXA'TP^r )2H 
i'Ari;rp:AA»n5 A05?TAAA'n> t»X^7<Jt7trP>l OATTAT^.»- 40n 
iTTACtiTATTA i;?nrAT»^?TA TrAiX'ATOTr ilATl^TrrnT 

osstsArm^: AwmT.rw: cxvattv.ai.1.: t^.'aati.jj.w: sr,f» 

OTTAACAACT OCtHXXX'ATT HAOVAAAHi fWnmntr <>4U 

i^AAiirrp^AO Ai*xf^?*n>^A c*w»^:kv<x: AOCTypppp 72 n 
iwia;AA03 «mxwrr^;A Act.>vTUATn7 m^^vfAA Btm 
o7irpnsa7TT i;TTA'n;a; A'n' rixTAt^nrrr Tm^itrrpn; bhi» 

ATAIXTP1W3I- tTTATW^TTr ^irrTATPSA CCATTAm^A <»f.(J 
ATAATA-pay TCTTPXTAr AAOIVT»rrTT ATP>:^TO H»4<) 

ATmTA*7A»:; MA'ns.i^.mrr oatttattat tta<.*aaatp' ii2n 
ttaaa<;ataa on»3.>:%*n.T i:<;a"ix*»;aat tTnaarrAo; izutt 

CTAi'ATt.i?i'!A i;tT*t"Ti?-n"t' i^ATi^O'TTCA i ;it^*^;A'n: 

t.TTA«>it.*ACA w:aoa?.>.v cawaccmv Aim^^;i:i^' ni,n 
«.v*.i^ jai : tv* *aci .1 < "niAr.w att *n > wvai w :tt 144 « 
"nn-Pi-pyvTA A<»*m"A».;Ai; ^TAA'm'iiv; T^^amxT i r .;*ti 
[i"*n^^i>iji* i;n;ri^AfVA«i acataata<v *n^Ai7ow>*TA tuio 

•n:i?TTA.>ltr r.ii'rA1tVkia: AtX'AAt.VrVA AO^VAfWJ 
AAiTtT -tf^n; *a>u:A«a^fA <;ATTiTTiar^ o^rptTAo - l7u> 
(.AiSlAAcyVl.X' UTtiAtW-T A* 3 Ti.A( ; < S as At » W » : IH4l> 

>>»7i:»:A;a'r i«j*CTA<^t:t: itjwnxiirr AiwrAAiviA I'^ii 
•nvAtjii^rt-T t:rn^»>^i^- **A(;A»jA«.'iy'«: AttiAtaTAfSTr 2uni» 
inrnx*Ti^Ai > cn3A , n>'95"T ai"ati>vivt om"war:t"T 2wu 
tw?n?™PXiA i3i.wp)$xnt? aai , tat»»ta ^ni^AAorr 2t'»»i 
T\-,-*P5iX-ti;a cAi7n?x w n^; 7n."na.T'.w.i tsctrctrccj 2'Jt An 

t.a;t;A(^t:*r;*A'n" 'nrr^Trm.a' cfv^Trniv: TiVrrrp'JTTT 

AAT>yV5IAAA TP^'ATiVA TP m^TllAi .T AOTT> WATT 
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5 
10 
15 

20 



30 



35 



24H1 CTATTVUJW OXnVWXm.: i.).»:"Ai3CAi.'A iSCAACM&ajA t.i.IATH » IAA liACAATATCA ^XTVPSCIUi : l^A'PJt i^HT 4 C -Mi 

2 Mil iPTlTAIl): iJTOWJn.nr AlJnJnTlA ATTO^acnS i;AiW»V*:t AOXAtt3T»3C *P3AAiJAATP'I ACm^rmv _»f.j|. 

2f»41 It'r^i'OtrrA AAAAtJCl *r*3' i3TT>5m3S1i3 TTTTTVCATA O^^XWV ^VflSAOVM:: CAT* S ACAAAA ATCnAn3t~iv 2720 

2721 AAtm'Ai5Ai5: im^AAACt:' OMCAOSACT ATTVAAOATAO CAWiTTTP: n vr"IU iAA' : ""t^rTYY:^; ' Of Tr^h^rH^ 2*tMi 

2801 mvW'iT ■SCWCTrAiX" OJATACCnST COSCCTTP.T i^^TniW» Aiav/n-SiKtV TTTfTVAA*]*: ^TTAfCCIKT *Hm; 

2«hi Ai.a rrATvn a iTtvv>mrro oriwnvw t^xaaoctw i-a^nv/iv^'A ri;AAiw*w »v, Anumvvj 1 ■••■mi 

a«»f,1 rTTAmnJTP AArTATVWir TniAtTT^'AA Ct^3rrTAA«3A CAOSACTrAT lUVACnyir AUTOViTU^T '^^TAAi'Ai S :A 1114 ii 

uni TTAtx*A*,iAi>: i^r.TA'nrrA uwwnrTA '.WJWimT tjAAtrnjimi.; o^aactao; o^ACMTAi : AAiv.ArAi.TA »; "> 

At 21 TTTUJTATTr i atyi\Tna.T <3AAtVrAiriT AiVTIVttJAA AAA»3Ai3Tn;r> TAlXT^TTlJA TVWVAAAr AAA«'**A(tia* i2:m 

A201 "P MTMSfWTr Uirrrrm« TTO VAA* JOA i;TAi3ATTftm njCACAAAAA AA03ATVTVA MIAAiVHViT TOlATiTTTT J'*h;i 

12*11 i.taotpsmt irccrAATWT rnxvAtrnrr taoaacvaat taaocaattv tvattojaaa aactvatviia o-atvaaah; mi.u 

Hbl AAAim^AAT TTATTTATAT CAi a^ATTOTT AATAO^ATAT TTPPJAAAAA iSCWTTIVTO TAAT3AA* J JA i.iAAAA<Tt "AT U4II 

1441 OiaiXX'ACTT rCATAttJATC iVAATATViT iSTIWnciXSTP TieWnW.; AtTYITn^AA t*ATVAATArA A'^TATTAAT *V'H 

isji TTnrcmcwr caaaaataai; uttatvaai :t ivw^AA'nrAc catsactsac »3A*m:;AA*jrc «>7n^A*3AA-n : ..vAAAAiaTT w,mi 

*WH ATlVATTTrT TPVAiJAiTT tTTTVAAf tAtf? rCAWCVATTA O5CT07P7AT CAAAATrArT rtVATVAAiV AAAiViflTAT *( 

u.hi ■p.'ATn^yi m^xir*niA iri.M.aitzAA ato oyi^ mv tvnrrrAAAA *>»i;aattac AAAi-waw^T oywn>-AAr *-f.o 

*7t.i r?.a;n;cA02A Ai.:A*?*n5>*A*ji wati *aaca atotttp*ai: c:T*UAATCAt3>;: atati^tpt aatai i:to;;a ATt^r-nrnTT *b4i* 

iM4) Cr-CaaS^ir IX-ACIUZTCA *7TAAc:-| -ATI A* A1VA1VAt?;A 'STArOi.JATAA AA*PXTIt?AT iSrp.YKAAi^A i^irATAAATT »'^0 

V>2J Ciini.*A"XVA irriTAiTirn; ATCATiTVAT ("TVITAAiATf ATPXa/AAri; r.'TAi\Tm;c t-ATtrrnVA'- AAATAArTT 4«i„i t 

4ftni wato^ i:rnv-(VATA paatvi^atai; AntnvYa/Ar ciwrnvrr iWATrAHi: nwxveATT tata*"'^'ata auuu 

TAAAIVAtVA TWAHTTIO; AATTTAAT> 1 1 i^*>?-nm^3At 3 rAA»3A^,TTT i v^TRyiAT A*n>3t^ATA A' "At *t "t *! **Tn i 4^ M: 

41 f»l TATTA^TTT TAHTTAAi^'A iSACAiJTTTTA TPTTTVATiA T3ATATATTT TTA'n--r , nTIt: t.'AA^3TAA«'A l*-Ai:AiWnT 41 

4241 T3Ai^i:acaa iima^rTm: c 4 ,.-i 



25 6. The polynucleotide of Claim 4 from which the PAb 

sequence has been removed. 



7. The plasmid vector of Claim 5 from which the PAb 
sequence has been removed. 

8. A method for inducing immune responses in a 
vertebrate against HCV epitopes which comprises introducing between 1 
ng and 100 mg of the polynucleotide of Claim 1 into the tissue of the 
vertebrate. 

9. A method for inducing immune responses against 
infection or disease caused by HCV which comprises introducing into 
the tissue of a vertebrate the polynucleotide of Claim 1 . 

40 10. A vaccine for inducing immune responses against 

HCV infection which comprises the polynucleotide of Claim I and a 
pharmaceutical^ acceptable carrier. 

11. A method for inducing anti-HCV immune responses 
45 in a primate which comprises introducing the polynucleotide of Claim 1 
into the tissue of said primate and concurrently administering 
interleukin-12 parenterally. 
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12. A method of inducing an antigen presenting cell to 
stimulate cytotoxic and helper T-cell proliferation an effector functions 
including lymphokine secretion specific to HCV antigens which 

5 comprises exposing cells of a vertebrate in vivo to the polynucleotide of 
Claim 1. 

13. A method of treating a patient in need of such 
treatment comprising administering to the patient the polynucleotide of 

10 Claim 1 in combination with interferon-alpha, Ribavirin, Zidovudine, 
or other pharmaceutical ly acceptable antiviral agents.. 

14. A pharmaceutical composition comprising the 
polynucleotide of Claim 1 . 

15 

15. A method of inducing an immune response 
comprising administering the polynucleotide of Claim 1 to a patient, the 
administration of the polynucleotide antedating or coinciding or 
following administration to the patient of a subunit, recombinant, 

20 recombinant live vector, inactivated, recombinant inactivated vector, or 
live attenuated HCV vaccine. 

16. A method for inducing immune responses in a 
vertebrate against HCV epitopes which comprises introducing between 1 

25 ng and 100 mg of the polynucleotide of Claim 2 into the tissue of the 
vertebrate. 

17. A method for inducing immune responses against 
infection or disease caused by HCV which comprises introducing into 

30 the tissue of a vertebrate the polynucleotide of Claim 2. 

18. A vaccine for inducing immune responses against 
HCV infection which comprises the polynucleotide of Claim 2 and a 
pharmaceutical^ acceptable carrier. 
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19. A method for inducing anti-HCV immune responses 
in a primate which comprises introducing the polynucleotide of Claim 2 
into the tissue of said primate and concurrently administering 

5 interleukin 12 parenterally. 

20. A method of inducing an antigen presenting cell to 
stimulate cytotoxic and helper T-cell proliferation an effector functions 
including lymphokine secretion specific to HCV antigens which 

10 comprises exposing cells of a vertebrate in vivo to the polynucleotide of 
Claim 2. 

21 . A method of treating a patient in need of such 
treatment comprising administering to the patient the polynucleotide of 

15 Claim 2 in combination with interferon-alpha, Ribavirin, Zidovudine, 
or other pharmaceutical ly acceptable antiviral agents.. 

22. A pharmaceutical composition comprising the 
polynucleotide of Claim 2. 

20 

23. A method of inducing an immune respoase 
comprising administering the polynucleotide of Claim 2 to a patient, the 
administration of the polynucleotide antedating or coinciding or 
following administration to the patient of a subunit, recombinant, 

25 recombinant live vector, inactivated, recombinant inactivated vector, or 
live attenuated HCV vaccine. 

24. The vector of Claim 2 which is selected from 
VlRa.HCVlCorePAb, Vtpa.HCV 1 CoreP Ab, VUb.HCVlCorePAb, 

30 V 1 Ra.HCV 1 Core, Vtpa.HCV 1 Core and VUb.HCV 1 Core. 



25. A pharmaceutical composition comprising the vector 

of Claim 21. 
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26. The DNA sequence of Claim 1 selected from the 
group consisting of a nucleotide sequence shown in Figure 5, Figure 9, 
Figure 10, Figure 11, Figure 12 and Figure 13. 
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TABLE 3 

CODON UTILIZATION IN HUMAN PROTON-CODING SEQUENCES 



f 



F UUU 68 0.35 

UUC 125 0.65 

L UUA 20 0.05 

UUC 42 0.09 

CUU 50 0.11 

CUC 99 0.22 

CUA 30 0.07 

CUG 204 0.46 

I AUU 28 0.23 

AUC 79 0.64 

AUA 16 0.13 

M AUG 77 1.00 

V GUU 35 0.13 

GUC 72 0.27 

GUA 25 0.09 
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